Temporal Envelope and Fine Structure Cues for Dysarthric Speech Detection Using CNNs

نویسندگان

چکیده

Deep learning-based techniques for automatic dysarthric speech detection have recently attracted interest in the research community. State-of-the-art typically learn neurotypical and discriminative representations by processing time-frequency input such as magnitude spectrum of short-time Fourier transform (STFT). Although these are expected to leverage perceptual cues, STFT do not necessarily convey aspects complex sounds. Inspired temporal mechanisms human auditory system, this paper we factor signals into product a slowly varying envelope rapidly fine structure. Separately exploiting different cues present (i.e., phonetic information, stress, voicing) structure pitch, vowel quality, breathiness), two learned through convolutional neural network used detection. Experimental results show that both yields considerably better performance than only envelope, structure, or representation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Consonant identification using temporal fine structure and recovered envelope cues.

The contribution of recovered envelopes (RENVs) to the utilization of temporal-fine structure (TFS) speech cues was examined in normal-hearing listeners. Consonant identification experiments used speech stimuli processed to present TFS or RENV cues. Experiment 1 examined the effects of exposure and presentation order using 16-band TFS speech and 40-band RENV speech recovered from 16-band TFS sp...

متن کامل

Objective quality evaluation of noise-suppressed speech: effects of temporal envelope and fine-structure cues

While temporal envelope and fine-structure cues are known to be good predictors for speech intelligibility, it is not clear how well they are correlated with subjective quality ratings, particularly those using noise-suppressed speech. The present work evaluated the performance of two objective measures (i.e., NCM and TFSS), which were originally developed with primarily envelope or fine-struct...

متن کامل

The Role of Temporal Fine Structure Cues in Speech Perception

In this thesis, the importance of temporal fine structure (TFS) in speech perception is investigated. It is well accepted that TFS is important for sound localization and pitch perception, while envelope (ENV) is primarily responsible for speech perception. Recently, a significant contribution of TFS in speech perception has been suggested. This was linked to the improved ability of normal-hear...

متن کامل

Detection of speech landmarks using temporal cues

In order to improve the performance of speech recognizers, particularly in degraded environments, it may be bene cial to integrate use of temporal information. As literature has shown that human listeners are able to use temporal cues in speech recognition tasks, this study examines algorithms for extraction of temporal cues in a speech signal. The task under analysis is the location of landmar...

متن کامل

The role of recovered envelope cues in the identification of temporal-fine-structure speech for hearing-impaired listeners.

Narrowband speech can be separated into fast temporal cues [temporal fine structure (TFS)], and slow amplitude modulations (envelope). Speech processed to contain only TFS leads to envelope recovery through cochlear filtering, which has been suggested to account for TFS-speech intelligibility for normal-hearing listeners. Hearing-impaired listeners have deficits with TFS-speech identification, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Signal Processing Letters

سال: 2021

ISSN: ['1558-2361', '1070-9908']

DOI: https://doi.org/10.1109/lsp.2021.3108509